Syllable-based Thai duration model using multi-level linear regression and syllable accommodation
نویسندگان
چکیده
This paper proposes a syllable-based Thai duration model using multi-level linear regression and syllable accommodation. To build a timing model reflecting control characteristics directly, we introduce two analysis results on hierarchical control characteristics. First analysis result showed that syllable is highly correlated to higher-phone-level timing controls, while, phone differences by themselves do not affect higher control and contribute to local timing control only. Second one on the syllable accommodation showed that phone duration highly depends on local phone factors. These analysis results support a syllable-based hierarchical model proposed in this paper. Duration prediction experiments of 5fold cross validation showed 46.73 and 32.37 ms in RMS error, and, 0.905 and 0.811 in correlation between measured and predicted duration at syllable and phone levels, respectively. The comparison of prediction precision showed that the proposed syllable-based multi-level duration model better performed than a conventional single-level phone duration model.
منابع مشابه
Duration prediction using multi-level model for GPR-based speech synthesis
This paper introduces frame-based Gaussian process regression (GPR) into phone/syllable duration modeling for Thai speech synthesis. The GPR model is designed for predicting framelevel acoustic features using corresponding frame information, which includes relative position in each unit of utterance structure and linguistic information such as tone type and part of speech. Although the GPR-base...
متن کاملAnalysis and modeling of syllable duration for Thai speech synthesis
This paper describes the analysis results on the control factors of Thai syllable duration, and a statistical control model using linear regression technique. The analyses have been carried out both at a syllable level and at a phrase level. In a syllable level duration control, the effects of five Thai tones and syllable structures are investigated. To analyze syllable structure effects statis...
متن کاملCombining Prediction by Partial Matching and Logistic Regression for Thai Word Segmentation
Word segmentation is an important part of many applications, including information retrieval, information filtering, document analysis, and text summarization. In Thai language, the process is complicated since words are written continuously, and their structures are not well-defined. A recognized effective approach to word segmentation is Longest Matching, a method based on dictionary. Neverth...
متن کاملModeling Rhythmic Variation in Thai and its Application to Speech Synthesis
This study concerns a preliminary experiment on modeling the duration of Thai syllables. It is based on a corpus of minimal pairs of sentences only differing as to their stress patterns. Following a factor analysis of syllabic durations in the corpus a simple duration model was developed. This model was used for re-synthesizing the utterances by manipulating speech from a Thai TTS system by adj...
متن کاملA Study of Phoneme and Syllable Duration Characteristics of Mandarin Chinese
The multiple regression model was used to study the phoneme and syllable duration characteristics of mandarin Chinese. The source speech material is a phonetically balanced text corpus collected from newspapers and spoken by a professional female announcer. Since the syllable, in an Initial/Final format, was adopted as a basic synthesis unit in our Chinese TTS system, the investigations were ta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007